RAPID: A Reachable Anytime Planner for Imprecisely-sensed Domains
نویسندگان
چکیده
Despite the intractability of generic optimal partially observable Markov decision process planning, there exist important problems that have highly structured models. Previous researchers have used this insight to construct more efficient algorithms for factored domains, and for domains with topological structure in the flat state dynamics model. In our work, motivated by findings from the education community relevant to automated tutoring, we consider problems that exhibit a form of topological structure in the factored dynamics model. Our Reachable Anytime Planner for Imprecisely-sensed Domains (RAPID) leverages this structure to efficiently compute a good initial envelope of reachable states under the optimal MDP policy in time linear in the number of state variables. RAPID performs partially-observable planning over the limited envelope of states, and slowly expands the state space considered as time allows. RAPID performs well on a large tutoring-inspired problem simulation with 122 state variables, corresponding to a flat state space of over 10 states.
منابع مشابه
A Hybridized Planner for Stochastic Domains
Markov Decision Processes are a powerful framework for planning under uncertainty, but current algorithms have difficulties scaling to large problems. We present a novel probabilistic planner based on the notion of hybridizing two algorithms. In particular, we hybridize GPT, an exact MDP solver, with MBP, a planner that plans using a qualitative (nondeterministic) model of uncertainty. Whereas ...
متن کاملAn anytime approach for on-line planning
In this paper we present a novel planning approach, based on well-known techniques such as goal decomposition and heuristic planning, aimed at working in highly dynamic environments with time constraints. Our contribution is a domainindependent planner to incrementally generate plans under a deliberative framework for reactive domains. The planner follows the anytime principles, i.e a first sol...
متن کاملA Robotic Execution Framework for Online Probabilistic (Re)Planning
Due to the high complexity of probabilistic planning algorithms, roboticists often opt for deterministic replanning paradigms, which can quickly adapt the current plan to the environment’s changes. However, probabilistic planning suffers in practice from the common misconception that it is needed to generate complete or closed policies, which would not require to be adapted on-line. In this wor...
متن کاملAn On-Line Approach for Planning in Time-Limited Situations
In this paper we present a novel planning approach, based on wellknown techniques such as goal decomposition and heuristic planning, aimed at working in highly dynamic environments with time constraints. Our contribution is a domain-independent planner to incrementally generate plans under a deliberative framework for reactive domains. The planner follows the anytime principles, i.e a first sol...
متن کاملUsing Anytime Planning for Centralized Coordination of Multiple Robots in Real-Time Dynamic Environments
We are investigating the use of planning in multi-robot, real-time, dynamic environments. Each individual robot is considered to be an effector of a centralized planning system. In a real-time environment, it is important to be careful to ensure that the time required to find a good plan does not itself reduce the system’s ability to complete its tasks in a timely manner. Our physically situate...
متن کامل